The Fibers and Range of Reduction Graphs in 

Ciliates 



Robert Brijder and Hendrik Jan Hoogeboom 

Leiden Institute of Advanced Computer Science, Universiteit Leiden, 
Niels Bohrweg 1, 2333 CA Leiden, The Netherlands, 
rbrijder@liacs.nl 

February 1, 2008 

Abstract 

The biological process of gene assembly has been modeled based on 
three types of string rewriting rules, called string pointer rules, defined on 
so-called legal strings. It has been shown that reduction graphs, graphs 
that are based on the notion of breakpoint graph in the theory of sorting by 
reversal, for legal strings provide valuable insights into the gene assembly 
process. We characterize which legal strings obtain the same reduction 
graph (up to isomorphism), and moreover we characterize which graphs 
are (isomorphic to) reduction graphs. 

More formally, let 1Z be the function which assigns to each legal string 
u its reduction graph 1Z U . We characterize the fiber TiT^iTZu) (modulo 
graph isomorphism) for each reduction graph 1Z U - In fact we show that 
1Z~ (1Z U ) is the 'orbit' of u under two types of string rewriting rules, 
which are in a way dual to two of the three types of string pointer rules. 
We also characterize the range of 1Z in terms of easy-to-check conditions 
on graphs. 

1 Introduction 

Ciliates form a large group of one-cellular organisms that are able to transform 
one nucleus, called the micronucleus, into an astonishing different one, called the 
macronucleus. This intricate DNA transformation process is called gene assem- 
bly. Each gene in the micronucleus, called micronuclear gene, is transformed to 
a gene in the macronucleus, called macronuclear gene. The string pointer reduc- 
tion system models gene assembly based on three types of string rewriting rules, 
called string pointer rules, defined on so-called legal strings [6 . In this model, 
a micronuclear gene is represented by a legal string u, while its macronuclear 
gene (with its waste products) is represented by the reduction graph of u 0] . 
The reduction graph is based on the notion of breakpoint graph in the theory 
of sorting by reversal [H [TJ [TU] . Q 
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In this paper we characterize which graphs are (isomorphic to) reduction graphs 
(cf. Theorem [2~4"|) . Obviously, these graphs should have the 'look and feel' of re- 
duction graphs. For instance, each vertex label should occur exactly four times, 
and the second type of edges connect vertices of the same label. Once these el- 
ementary properties are satisfied, reduction graphs are characterized as having 
a connected pointer-component graph — a graph which represents the distribu- 
tion of the vertex labels over the connected components. The characterization 
corresponds to an efficient algorithm. In this way we obtain a restriction on the 
form of the macronuclear structures that can possibly occur. We also provide a 
characterization that determines, given two legal strings, whether or not they 
have the same reduction graph (cf. Theorem I34|). This may allow one to deter- 
mine which micronuclear genes obtain the same macronuclear structure. It turns 
out that two legal strings obtain the same reduction graph (up to isomorphism) 
exactly when they can be transformed into each other by two types of string 
rewriting rules, which surprisingly are in a sense dual to the string positive rules 
and the string double rules (two of the three types of string pointer rules). 

The latter characterization has other uses as well. In a sense, the reduction graph 
allows for a complete characterization of applicability of string negative rules, the 
other type of string pointer rules, during the transformation process EH [21 H]- 
Moreover, it has been shown that the reduction graph does not retain much 
information about the applicability of the other two types of rules [3]. Therefore, 
the legal strings that obtain the same reduction graph are exactly the legal 
strings that have similar characteristics concerning the string negative rule. 

To establish both main results, we augment the (abstract) reduction graph with 
a set of merge-legal edges. We will show that some "valid" sets of merge-legal 
edges for a reduction graph allows one to "go back" to a legal string correspond- 
ing to this (abstract) reduction graph. In this way the existence of such valid set 
determines which graphs are (isomorphic to) reduction graphs. The first main 
result shows that the existence of such valid set is computationally easy to verify. 
Moreover, the set of all sets of merge-legal edges can be transformed into each 
other by flip operations. These flip operations can be defined in terms of the 
above mentioned dual string pointer rules on legal strings. This will establish 
the other main result. 

This paper is organized as follows. Section [H fixes notation of basic mathe- 
matical notions. In Section [3] we recall the string pointer reduction system, in 
Section [4] we recall the reduction graph and the pointer-component graph, and 
in Section [5] we generalize the notion of reduction graph and give an extension 
through merge-legal edges. In Section [S] we provide a preliminary characteriza- 
tion that determines which graphs are (isomorphic to) reduction graphs. In the 
next three sections, we strengthen the result to allow for efficient algorithms: in 
Section [7] we define the flip operation on sets of merge-legal edges, in Section [8] 
we show that the effect of flip operation corresponds to merging or splitting of 
connected components, and in Section [5] we prove the first main result, cf. The- 
orem In Sections [TU1 and [TT1 we prove the second main result, cf. Theorem [3"H 
We conclude this paper with a discussion. 
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2 Mathematical Notation and Terminology 



In this section we recall some basic notions concerning functions, strings, and 
graphs. We do this mainly to fix the basic notation and terminology. 

The symmetric difference of sets X and Y, (X\Y) U (Y\X), is denoted by 
X © Y. The symmetric difference of a finite family of sets (Xi)i e A is denoted 
by Q) ieA Xi. The composition of functions / : X — > Y and g : Y — > Z is 
the function gf : X — > Z such that (gf)(x) — g(f(x)) for every x G X. The 
restriction of / to a subset A of X is denoted by f\A, . The range f(X) of / 
will be denoted by rng(/). We define for y 6 Y, f^ 1 (y) = {x £ X \ f(x) = y}. 
If Y = X, then / is called self-inverse if f 2 is the identity function. We will use 
A to denote the empty string. 

We now turn to graphs. A (undirected) graph is a tuple G = (V, E), where V is 
a finite set and E C {{x, y} | x, y G V^}. The elements of V are the vertices of 
G and the elements of E are the edges of G. In this paper we allow x = y, and 
therefore edges can be of the form {x, x} = {x} — an edge of this form should 
be seen as an edge connecting x to x, i.e., a 'loop' for x. The restriction of G to 
E' C E, denoted by G\ E >, is (V, E'). The order \V\ of G is denoted by o{G). 

A multigraph is a (undirected) graph G = (V, E, e), where parallel edges are 
possible. Therefore, E 1 is a finite set of edges and e : E — » {{x, y} | x, y G y} is 
the endpoint mapping. 

A coloured base B is a 4-tuple (V,f,s,t) such that is a finite set, s,t G V, 
and / : V\{s, £} — > T for some I\ The elements of 1/, {{x, y} | x, y G V, x 7^ y}, 
and r are called vertices, edges, and vertex labels for £?, respectively. 

A n-edge coloured graph, n > 1, is a tuple G = (V, £1, -E2, • ■ • , E n , f, s, t) where 
B = (V, f, s, t) is a coloured base and, for i G {I, . . . , n}, E t is a set of edges for 
B. We also denote G by B(E 1 ,E 2 , ■■■ , E n ). We define dom(G) = rng(/). 

The previously defined notions and notation for graphs carry over to multigraphs 
and n-edge coloured graphs. Isomorphisms between graphs are defined in the 
usual way: they are considered isomorphic when they are equal modulo the 
identity of the vertices. Thus, multigraphs G = (V, E, e) and G' — (V',E,e') 
are isomorphic if there is a bijection a : V — > V such that ae — e', or more 
precisely, for e G E, e(e) = {fi,^} implies e'(e) = {a(vi), a(t>2)}. We assume 
the reader is familiar with the notions of cycle and connected component in a 
graph. A graph is called connected if it has exactly one connected component, 
and it is called acyclic when it does not contain cycles. 

3 String Pointer Reduction System 

The string pointer reduction system is the model of gene assembly that is used in 
this paper. In this section we give a concise description of this system, omitting 
examples and motivation. We refer to [7] for an in-depth description of this 
model including motivation and examples. 

We fix k > 2, and define the alphabet A = {2, 3, ... , k}. For D C A, we define 
D = {a I a G D} and II — A U A. The elements of II will be called pointers. 
We use the 'bar operator' to move from A to A and back from A to A. Hence, 
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for p G II, p — p. For a string u — x\Xi ■ ■ ■ x n with Xi G IT, the inverse of u is 

{p ifpgA 
_ , i.e., p is 
p if p G A 

the 'unbarred' variant of p. The domain of a string u £ II* is dom(v) = {p | 
p occurs in w}. A legal string is a string u € II* such that for each p G II that 
occurs in u, it contains exactly two occurrences from {p,p}. For a pointer p and 
a legal string u, if both p and p occur in u then we say that both p and p are 
positive in it; if on the other hand only p or only p occurs in u, then both p and 
p are negative in ?i. 

Let it = £12:2 ' ' ' x n be a legal string with x% G II for 1 < i < n. For a pointer 
pell such that {2^,2^} C {p,p} and 1 < i < j < n, the p-interval of u is 
the substring XiXi+i ■ ■ ■ Xj. Two distinct pointers p, q G II overlap in u if both 
q G dom(Ip) and p G dom{I q ), where 7 p resp.) is the p-interval (g-interval, 
resp.) of 11. 

The string pointer reduction system consists of three types of reduction rules, 
called string pointer rules, operating on legal strings. In this paper we will not 
consider these rules directly, but rather study the reduction graph (which is 
recalled in the next section) that captures essential properties of the rewriting 
system. For completeness we list the rules. For all p, q G LI with p^q: 

• the string negative rule for p is defined by sxir p (u\ppu2) = U1U2, 

• the string positive rule for p is defined by spr p (itipii2pit3) = 111U2U3, 

• the string double rule for p, q is defined by s&T p q (uipu2qu3pu<tqu5) = 

where u\, 112, ■ ■ • ,itg are arbitrary (possibly empty) strings over II. 

We say that legal strings 11 and v are equivalent, denoted by u k, v, if there is 
homomorphism ip : II* — > II* with ip(p) £ {p,p} and <p(p) = ip(p) for all p G II 
such that <p(u) = v. 

Example 1 

Legal strings 2233 and 2233 are equivalent, while 2233 are 2233 are not. 

Note that « is an equivalence relation. Equivalent legal strings are characterized 
by their 'unbarred version' and their set of positive pointers. 

The domain of a reduction rule p, denoted by dom(p), is defined by dom(snr p ) = 
dom(spr p ) = {p} and dom(sdr Pig ) = {p,q} for p, q G II. For a composition 
(p = p n ■■■ P2 pi of reduction rules p\, p2, ■ ■ ■ , p n , the domain, denoted by 
dom(ip), is dom(pi) U dom(p2) U • • • U dom(p„). 

A composition if of reduction rules is called a reduction. Let it be a legal string. 
We say that ip is a reduction of u, if ip is a reduction and ip is applicable to 
(defined on) u. A successful reduction ip of u is a reduction of u such that 
p{u) = A. We then also say that ip is successful for u. For every legal string 
there exists a successful reduction, which in general is not unique [7]. 
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Figure 1: The reduction graph 1Z U of u in Example [5] 

4 Reduction Graph 

We now recall the definition of reduction graph. This definition is equal to the 
one in [3J, and is in slightly less general form compared to the one in [5]. We 
refer to [S], where it was introduced, for a motivation and for more examples 
and results. The notion of reduction graph uses the intuition from the notion of 
breakpoint graph (or reality-and-desire diagram) known from another branch of 
DNA processing theory called sorting by reversal, see e.g. [10] and [9]. From a 
biological point of view, the reduction graph represents the macronuclear form 
of a gene given its micronuclear form. This micronuclear form of the gene is 
represented by a legal string, and therefore reduction graphs are defined on 
legal strings. 

Definition 1 

Let u = P1P2 ■ ■ ■ p n with p\ , . . . , p n € II be a legal string. The reduction graph 
of u, denoted by TZ U , is a 2-edge coloured graph (V, E\, E 2 , /, s,t), where 

v = {h,h,...,i n } u {/(,4,. ..,i;> u { s ,t}, 

Ei = {e ,ei, . . . ,e„} with e t = for 1 < i < n, e = {s,Ix},e n = {l' n ,t}, 

E 2 = {{I<, Ij}, {It, Ifi {1,2,..., n} withi =£j and pi = Pj } U 
{/■, Ij} I i,j 6 {1,2, ...,n} andp 4 =pj}, and 

/(/,) = f(Il) = Pl for 1 < i < n. 

m 

The edges of E\ are called the reality edges, and the edges of E 2 are called the 
desire edges. Notice that for each p £ dom(u), the reduction graph of u has 
exactly two desire edges containing vertices labelled by p. It follows from the 
construction of the reduction graph that, given legal strings u and v, u w v 
implies that 1Z U « TZ V . 

In depictions of reduction graphs, we will represent the vertices (except for s 
and t) by their labels, because the exact identity of the vertices is not essential 
for the problems considered in this paper. We will also depict reality edges as 
'double edges' to distinguish them from the desire edges. 

Example 2 

The reduction graph of u = 274735342656 is depicted in Figure [TJ Note how 
positive pointers are connected by crossing desire edges, while those for negative 
pointers are parallel. By rearranging the vertices we can depict the graph as 
shown in Figured) 
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s^=2 2^=6 6^=t 6 6 



2^=7 7 7 7^=3 5 5 

2^=4 4 4 4^=3 5 5 

3 3 

Figure 2: The reduction graph of Figure [1] obtained by rearranging the vertices. 




R 



3 2 

4 

C*2 C3 

7 

Figure 3: The pointer-component graph of the reduction graph from Figured] 

Reality edges follow the linear order of the legal string, whereas desire edges 
connect positions in the string that will be joined when performing reduction 
rules, see [5]. 

We now recall the definition of pointer-component graph of a legal string, in- 
troduced in [3]. Surprisingly however, this graph has different uses in this paper 
compared to its original uses in [3] , where it is used to characterize which string 
negative rules are used in successful reductions of the legal string. 

Definition 2 

Let u be a legal string. The pointer- component graph of u (or oflZ u ), denoted 
by VC U , is a multigraph (£, E, e), where £ is the set of connected components 
of 7Z U , E = dom(w) and e is, for e S E, defined by e(e) = {C 6 ( \ C contains 
vertices labelled by e}. ■ 

Example 3 

The pointer-component graph of the reduction graph from Figure [2] is shown in 
Figure El 

5 Abstract Reduction Graphs and Extensions 

In this section we generalize the notion of reduction graph as a starting point 
to consider which graphs are (isomorphic to) reduction graphs. Moreover, we 
extend the reduction graphs by a set of edges, called merge edges, such that, 
along with the reality edges, the linear structure of the legal string is preserved 
in the graph. 
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5 5^=9 8 8 
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5 5 4 4^=9 7 7 9 



2 2 3 3^=8 8^=3 3 9 



4 4 7 7^=6 6^=6 6 t 

Figure 4: An abstract reduction graph. 

We will now define a set of edges for a given coloured base which has features 
in common with desire edges of a reduction graph. 

Definition 3 

Let B = (V, /, s, t) be a coloured base. We say that a set of edges E for B is 
desirable if 

1. for all {v u v 2 } G J5, /M = /(v 2 ), 

2. for each u G V\{s, t} there is exactly one e G E such that w G e. ■ 

We now generalize the concept of reduction graph. 
Definition 4 

A 2-edge coloured graph B(Ei,E%) with B = (V,f,s,t) is called an abstract 
reduction graph if 

1. rng(/) C A, and for each p G rng(/), \f~ 1 (p)\ = 4, 

2. for each v G V there is exactly one e € E\ such that v G e, 

3. £2 is desirable for B. ■ 

The set of all abstract reduction graphs is denoted by Q. 

Clearly, if G ss 1Z U for some it, then G e Q. Therefore, for abstract reduction 
graphs G = B(E%, E2), the edges in E\ are called reality edges and the edges in 
E2 are called desire edges. For graphical depictions of abstract reduction graphs 
we will use the same conventions as we have for reduction graphs. Thus, edges 
in Ei will be depicted as "double edges" , vertices are represented by their label, 
etc. 

Example 4 

The 2-edge coloured graph in Figure |4] is an abstract reduction graph. 

Note that conditions (1) and (3) in the previous definition imply that for each 
p G rng(/), there is a partition {ei, e 2 } of f~ 1 (p), denoted by Cg, p or C p when 
G is clear from the context, such that e\, e 2 G £2- 

We now introduce an extension to reduction graphs such that the 'generic' linear 
order of the vertices s, 7i, I' n , t is retained, even when we consider the 

graphs up to isomorphism. 
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S = 2— 2 = 7r-7 = 4 — 4 = 7-;7 = 3--3 = 5;-5 = 3--3 = 4--4 = 2 — 2 = 6--6 = 5-^5 = 6--6 = t 



Figure 5: The extended reduction graph £ u of u given in Example [21 
Definition 5 

Let u be a legal string. The extended reduction graph of u, denoted by £ u , is a 3- 
edge coloured graph B(Ei,E 2 ,E 3 ), where 1Z U — B{E\,E 2 ) and E 3 = {{Ij, I-} \ 



The edges in E3 are called the merge edges of u, denoted by M u . In this way, 
the reality edges and the merge edges form a unique path which passes through 
the vertices in the generic linear order. This is illustrated in the next example. 
In figures merge edges will be depicted by "dashed edges" . 

Example 5 

The extended reduction graph £ u of u given in Example [2] is shown in Figure [HI 
cf. Figure [U 



The notion of merge edges for (extended) reduction graphs is more closely re- 
lated to the notion of reality edges for breakpoint graphs in the theory of sorting- 
by-reversal compared to the notion of reality edges for (extended) reduction 
graphs. Thus in a way it would be more natural to call the merge edges reality 
edges for (extended) reduction graphs, and the other way around. However, to 
avoid confusion with earlier work, we do not change this terminology. 

We now generalize this extension of reduction graphs to abstract reduction 
graphs. 

Definition 6 

Let G = B(Ei 1 E 2 ) G G, and let £ be a set of edges for B. We say that E 
is merge-legal for G if E is desirable for B, and E 2 n E = 0. We denote the 
set {E I E merge-legal for G} by log- The set of all E G log where B(E\,E) is 
connected is denoted by 9c- ■ 

For legal string u, we also denote lo-r u and 9-ji u by lo u and 9 U , respectively. 

Notice that M u G 9 U C lo u . Therefore, merge-legal edges will also be depicted 
by "dashed edges" . 

Example 6 

Let us consider the abstract reduction graph G = B{E\,E 2 ) of Figure [5] This 
graph is again depicted in Figure [7] including a merge-legal set E for G. In 
this way Figure [7] depicts the 3-edge coloured graph B[E\,E 2 ,E). Notice that 
E $ 6c- In Figure the abstract reduction graph is depicted with a merge-legal 
set in 9c- 

We now define a natural abstraction of the notion of extended reduction graph. 



1 < % < n} with n = \u\. 



Remark 
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s^=2 2^=3 3^=i 

2^=3 
2^=3 

Figure 6: An abstract reduction graph. 
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Figure 7: The abstract reduction graph of Figure El with a set of merge-legal 
edges. 
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Figure 8: The abstract reduction graph of Figure El with another set of merge- 
legal edges. 
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6 6^=t 6 6 



2^=7 7 7 7^=3 
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\ / 




/ \ 
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/ \ 



3 \ 5 

\ 



Figure 9: A extended abstract reduction graph obtained by augmenting the 
reduction graph of Figure [2] with merge edges. 



Definition 7 

Let G = B{E 1 ,E 2 ) G Q and E G 6 G . Then G' = B(E ll E 2l E) is called a 
extended abstract reduction graph. m 

For each legal string it, £ u is an extended abstract reduction graph, since M u G 
# u . Therefore, the edges in E (in the previous definition) are called the merge 
edges (of G'). Since E £ 6q, B[E\,E) has the following form: 

S = Pl Pl = p 2 P2 = ■ ■ ■ = Pn Pn = t 

Thus the property that reality and merge edges in an extended reduction graph 
induce a unique path from s to t that alternatingly passes through reality edges 
and merge edges is retained for extended abstract reduction graphs G in general. 

Example 7 

If we consider the reduction graph 1Z U — B(E\,E2) of Example [U shown in 
Figured! then, of course, B{E\, E 2 , M u ) — £ u shown in Figured] is a extended 
abstract reduction graph. In Figure[9]another extended reduction graph is shown 
- it is TZ U augmented with a set of merge edges E in 6 U . It is easy to see that 
indeed E G 9 U : simply notice that the path from s to t induced by the reality 
and merge edges will go through every vertex of the graph. 



6 Back to Legal Strings 

In this section we show that for extended abstract reduction graphs G we can 
'go back' in the sense that there are legal strings u such that G is isomorphic 
to £ u . Moreover we show how to obtain the set La of all legal strings that 
corresponds to G. We will show that the legal strings in Lq are equivalent, and 
thus that extended reduction graphs retain all essential information of the legal 
strings. 
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Figure 10: The extended abstract reduction graph G given in Example [5] 



As extended abstract reduction graphs have a natural linear order of the vertices 
given by their reality edges and merge edges, we can infer whether or not desire 
edges 'cross' or not. Thereby providing a way to define negative and positive 
pointers for extended abstract reduction graphs. 

Definition 8 

Let G = B(E±, E2, -B3) be an extended abstract reduction graph, let G' — 
B(Ei, E2), and let 7r = [s, v\, v[, ■ ■ ■ ,v n ,v' n ,t) be the path from s to t in 
B(E 1 ,E 3 ). We say that p G dom(G) is negative in G iffCa>,p — v'A, {v' i: Vj}} 
for some i,j G {1, . . . ,n} with i ^ j. Also, we say that p G dom(G) is positive 
in G if p is not negative in G. m 

Clearly, p G dom(G) is positive in G iff Cg', p — {{vi, Vj}, {v[, v'^}} for some 
i, j G {1, . . . , n} with i ^ j. It is easy to see that p is negative in legal string u 
iff p is negative in £ u . 

The next definition defines a set of legal strings for each extended abstract 
reduction graph. 

Definition 9 

Let G = B(E\, E2, E3) be an extended abstract reduction graph, let G' = 
B(E 1 ,E 2 ), and let H = B{E U E 3 ) be as follows: 

S = Pl Pl = P2 P2 = ■ ■ ■ = Pn Pn = t 

The legalization of G, denoted by Lq 1 is the set of legal strings u — p\Pi • • ■ Pn 
with pi G {p^ Pi} and pi is negative in u iff pi is negative in G. u 

Example 8 

Let us consider the extended abstract reduction graph G of Figure [HI By re- 
arranging the vertices we obtain Figure 1101 From this figure it is clear that 
v = 27488265374356 G L G . 

It is easy to see that, for a legal string u, we have u G Lg u . 

Note that Lq, for extended abstract reduction graph G, is an non-empty equiv- 
alence class w.r.t. to the « relation (for legal strings). Since the definition of Lq 
does not depend on the exact identity of the vertices of G, we have, for extended 
abstract reduction graphs G and G', G « G' implies Lq = Lq* . 

Theorem 10 

1. Let G and G' be extended abstract reduction graphs. Then G ~ G' iff 
Lg = Lg<- 

2. Let u and v be legal strings. Then u ^ v iS £ u ~ £ v . 
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Proof 

We first consider statement 1. We have already established the forward impli- 
cation. We now prove the reverse implication. Let G — B(E\, E 2 , E 3 ), G' = 
B'(E[, E' 2 , E' 3 ), and Lq — Lq>- By the definition of legalization, B(Ei,E 3 ) « 
B'(E[,E' 3 ) and p is negative in G iff p is negative in G' for p G dom(G) = 
dom(G'). Therefore, G « G' . 

We now consider statement 2. We have u « v iff u, v G Ls u — Lg v (since 
legalizations are equivalence classes of legal strings w.r.t w) iff £ u k, £ v (by the 
first statement). ■ 

Let G be an extended abstract reduction graph, and take u G Lq (such a u 
exists since Lq is nonempty). Since u £ Lg u and legalizations are equivalence 
classes, we have Lg u = Lq and therefore G ~ £ u . Thus every extended abstract 
reduction graph G is isomorphic to an extended reduction graph. In fact, it 
is isomorphic to precisely those extended reduction graphs £ u with u G Lq. 
Therefore, this u is unique up to equivalence. 

Corollary 11 

Let u and v be legal strings. If 1Z U ~ 1Z V , then there is a E 1 G 9 U such that 
£, w B(E 1 ,E 2 ,E) with 7e u = B(E 1 ,E 2 ). 

Proof 

Since 7?. u ~ T^, there is an set of edges -E for 7?.„ such that w B(E%, E 2 ,E). 
Since G 6*„, we have E £ 9 U . m 

We end this section with a graph theoretical characterization of reduction graphs. 
Theorem 12 

Let G be a 2-edge coloured graph. Then G is isomorphic to a reduction graph 
iff G G G and 6 G ^ 0. 

Proof 

Let G « 7£ u for some legal string u. Then clearly, G G 5- Also, M n G # u and 
hence 9 U ^ 0. Therefore, 9q ^ 0. 

Let E E 9q. Then G" = B(E\, E 2 , E) is an extended abstract reduction graph 
with G = B(Ei,E 2 ). By the paragraph below Theorem fTUl G' « £ u for some 
legal string it (take u G Lq>). Hence, G « 7?. u . ■ 



7 Flip Edges 

In this section and the next two we provide characterizations of the statement 
9q 0. This allows, using Theorem Q21 for a characterization that corresponds 
to an efficient algorithm that determines whether or not a given G G Q is 
isomorphic to a reduction graph. Moreover, it allows for an efficient algorithm 
that determines a legal string u for which G ~ 1Z U . 

Let G G Q. Then a merge- legal set for G is easily obtained. For each p G dom(G) 
with C p — V2}, {«3, ^4}}, a merge-legal set for G must have either the 

edges {vi,«3} and {v2,V4} or the edges {^1,^4} and {^2,^3}, see both sides in 
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v i v 3 vi v 3 

\ / 

<-> ^ / 

/ \ 

/ \ 

V 2 l>4 1>2 «4 



Figure 1 1 : Flip operation for p. All vertices are labelled by p 
s^=2 2^=2 2^=i 



3^=3 



3^=3 

Figure 12: An abstract reduction graph G for which Qq = 0. 

Figure[TlJ By assigning such edges for each p 6 dom(G) we obtain a merge-legal 
set for G. Thus, log ^ for each G £ Q. Note that in particular, if dom(G) = 0, 
then log — {0}- However, Qq can be empty as the next example will illustrate. 

Example 9 

It is easy to see that the abstract reduction graph G of Figure [12] does not have 
a merge-legal set in 8q- 

We now formally define a type of operation that in Figure [TT] transforms the 
situation on the left-hand side to the situation on the right-hand side, and the 
other way around. Informally speaking it "flips" edges of merge-legal sets. 

Definition 13 

Let G = B(Ei,E2) 6 G, let / be the vertex labeling function of G, and let 
p G dom(G). The flip operation for p (w.r.t. G), denoted by flip G p , is the 
function loq — > luq defined by: 

flip GiP (£0 - {{v!,v 2 } e E | f( Vl ) +p + f(v 2 )} U {e x , e 2 }, 

where e\ and e 2 are the two edges with vertices labelled by p such that e\, e 2 
E 2 UE. m 

When G is clear from the context, we also denote flipc p by flip p . 

Since by Figure [TT1 there are exactly two edges e\ and e 2 with vertices labelled 
by p that are not parallel to both the edges in E 2 U E, flip p is well defined. It is 
now easy to see that indeed flip p (£') e log for E 6 log- 

Example 10 

Let G be the abstract reduction graph of Figure [H If we apply flip G 2 to the set 
of merge-legal edges depicted in Figure [3 then we obtain the set of merge-legal 
edges depicted in Figure [51 
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G 2 R G 5 



Figure 13: The pointer-component graph of the abstract reduction graph from 
Figure HI 

The next theorem follows directly from the previous definition and from the fact 
that Figure[TT]contains the only possible ways in which edges in merge-legal sets 
for G can be connected. 
Theorem 14 

Let G £ G, and denote by T be the group generated by the flip operations w.r.t. 
G under function composition. Then each element of T is self-inverse, thus J- 
is Abelian, and T acts transitively on wg. 

Let D = {pi, . . . ,p{\ C dom(G). Then we define flip^ = fhp p , ■ ■ • fhp Pl - Since 
T is Abelian, flip^ is well defined. Moreover, since each each element in T is self- 
inverse, T = {flip^, | D C dom(G)}. Also, if D U D 2 C dom(G) and D x ^ D 2 , 
then mp Di (E) ^ flip c ,,( J B). Thus the following holds. 

Theorem 15 

Let G eg. Then there is a bijection Q : 2 dom ( G ) -> T given by Q{D) = flip D . 
Moreover, for each E E ujq, l^g = {^Pd(^) I D ^= dom(G)}. 

8 Merging and Splitting Connected Components 

Let G = B(Ei,E2) be an abstract reduction graph and let E £ u>q. In this 
section we consider the effect of the flip operation on the pointer-component 
graph defined on the abstract reduction graph H = B(Ei,E). If we are able 
to obtain, using flip operations, a pointer-component graph consisting of one 
vertex, then flg ^ 0, and consequently by Theorem I12| G is isomorphic to a 
reduction graph. 

However, first we need to define the notion of pointer-component graph for 
abstract reduction graphs in general. Fortunately, this generalization is trivial. 

Definition 16 

Let G £ Q. The pointer- component graph ofG, denoted by VCq, is a multigraph 
(C, E, e), where C is the set of connected components of G, E — dom(G), and e 
is, for e G E, defined by e(e) = {G £ C | G contains vertices labelled by e}. ■ 

Example 11 

The pointer-component graph of the graph from Figure [4] is shown in Figure [13] 

Note that when G = B(Ei,E 2 ) £ Q and E £ ujq, then E is desirable for B. 
Hence, H = B(E\, E) is also an abstract reduction graph. Therefore, e.g., VCh 
is defined. 
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It is useful to distinguish the pointers that form loops in the pointer-component 
graph. Therefore, we define, for G £ G, bridge(G) = {e £ E |e(e) | = 2} 
where VCq — (V,E,e). In [3J, bridge(G) is denoted as snrdom(G). However, 
this notation does not make sense for its uses in this paper. 

Example 12 

From Figure [TS] it follows that bridge(G) = dom(G)\{3, 6} for the abstract 
reduction graph G depicted in Figured! 

Merge rules have been used for multigraphs, and pointer-component graphs in 
particular in [3J . The definition presented here is slightly different from the one 
in [3J - here the pointer p on which the merge rule is applied remains present 
after the rule is applied. 

Definition 17 

For each edge p, the p-merge rule, denoted by merge p , is a rule applicable to 
(defined on) multigraphs G = (V, E, e) with p £ bridge(G). It is defined by 

merge p (G) = (V',E,e'), 

where V = (V\e(p)) U {v'} with v' V, and e'(e) = {h( Vl ) , h(v 2 )} iff e(e) = 
{vi,v 2 } where h(v) = v' if v £ e(p), otherwise it is the identity. ■ 

It is easy to see that merge rules commute. We are now ready to state the 
following result which is similar to Theorem 27 in [3J. 

Theorem 18 

Let G = B(Ei,E 2 ) £ Q, let E £ ui G , let H = B(E 1 ,E), and let, for p £ dom(G), 
H p = B(E 1 ,m Jip (E)). 

• If p £ bridge(if), then VCh p ~ merge p {VCh) 
(and therefore o(VC Hp ) = o(VC H ) - 1). 

• If p £ dom( J ff)\bridge( J ff), then o(TC H ) < o(TC Hp ) < o(VC H ) + 1. 
Proof 

First let p £ bridge(if). Let Ch, p = {{vi, V2}, {^3, V4}}. Then, H has the fol- 
lowing form, where each of the two edges in C'h. p are from different connected 
components in H and where, unlike our convention, we have depicted the ver- 
tices by their identity instead of their label: 

• ■ ■ = vi i> 2 = ■ ■ ■ 



■ ■ ■ = V 3 Vi = ■ ■ ■ 

Now, either {{vi, W4}, {v 2 , v 3 }} C E 2 or {{vi,v 3 }, {v 2 , W4}} C E 2 . Thus H p is of 
either 

• • • = Vi 

1 
1 
I 

• • ■ = v 3 



i'2 
I 
I 

Vi 
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or 



Vi v 2 

\ / 

\ / 

/ \ 
/ \ 
V Z Vi 



form, respectively. Thus in both cases, the two connected components are merged, 
and thus VCh p can be obtained (up to isomorphism) from VCh by applying 
the merge p operation. 

Now let p 6 dom(i/)\bridge(if). Then the edges in Ch, p belong to the same 
connected component. Thus H has the following form 

• • • = V! V 2 = ■ ■ ■ = V 3 Vi = ■ ■ ■ 

where C H , P = {{"i, ^2}, {"3, ^4}}- Again, either {{vi, w 4 }, {v 2 , v 3 }} C E 2 or 
{{vi, V3}, {v 2 , V4}} C E 2 . Thus Hp is of either 



vi 



C 2 



''3 



or 



v\ v 2 = ■ ■ ■ = v 3 Vi 



form, respectively. Thus, H p has either the same number of connected com- 
ponents of H or exactly one more, respectively. Thus, o(PCh) < o(VCh ) < 
o(VC H ) + l. " m 

Example 13 

Let G = B(E\, E 2 ) € Q be as in Figure [Sj If we take E £ loq as in Figure [71 then 
2 G bridge^) with H = B{E 1 ,E). Therefore, by Theorem US and the fact that 
G has exactly two connected components, H 2 — B(Ei,Qip 2 (E)) is a connected 
graph. Indeed, this is clear from Figure [5] (by ignoring the edges from E 2 ). 



Informally, the next lemma shows that by applying flip operations, we can shrink 
a connected pointer-component graph to a single vertex. In this way, the under- 
lying abstract reduction graph is a connected graph. 

Remark 

The next lemma appears to be similar to Lemma 29 in [3]. Although the flip 
operation (defined on graphs) and the rem operation (defined on strings) are 
quite distinct, they do have a similar effect on the pointer-component graph. 

Lemma 19 

Let G = B{E 1 ,E 2 ) £ G, let E E lo g , let H = B{E U E), and let D C dom(G) = 
dom(-ff). Then TC h \d is a tree iff B(E U fhp D (£T)) and H have 1 and \D\ + 1 
connected components, respectively. 
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Proof 

Let D = {pi, . . . ,p n }- We first prove the forward implication. If VCh \d is a tree, 
then it has \D\ edges, and thus \D\ + 1 vertices. Therefore, VCh has \D\ + 1 
vertices, and consequently, H has \D\ + 1 connected components. Since VCh\d 
is acyclic, by Theorem IT51 

VC b{EuSXVd{e)) =VC B{El:{fiiPpn ... m Ppi )(E)) ~ (merge pre ••• merge pi ){VC H )- 

Now, applying \D\ merge operations on a graph with \D\ + 1 vertices, results in 
a graph containing exactly one vertex. Thus B(E\, ftip D (E)) has one connected 
component. 

We now prove the reverse implication. Moving from H — B(Ei, E) to B{E\,Qip D (E)) 
reduces the number of connected components in \D\ steps from \D\ + 1 to 1. 
By Theorem 1181 each flip operation of fhp D corresponds to a merge operation. 
Therefore (merge pii ■•• merge pi ) is applicable to VCh- Consequently, VCh\d 
is acyclic. Since this graph has |D| + 1 vertices, VC b \d is a tree. ■ 

9 Connectedness of Pointer-Component Graph 

In this section we use the results of the previous two sections to prove our 
first main result, cf. Theorem 1241 which strengthens Theorem 1121 bv replacing 
the requirement 6q ^ by a simple test on VCq- We now characterize the 
connectedness of VCq- 

Definition 20 

Let B = (V, /, s, t) be a coloured base. We say that a set of edges E for B is well- 
coloured (for B) if for each partition p = {V\, V 2 ) of V with f(Vi) n f(V 2 ) = 0, 
there is an edge {v%,V2} € E with V\ G V± and v 2 € V%- ■ 

We call G = B(Ei, E%) £ Q well- coloured if E\ is well-coloured for B. 
Lemma 21 

Let G € Q. Then VCq is a connected graph iff G is well-coloured. 
Proof 

Let G = B(Ei,E2) with B = (V, f, s, t). We first prove the forward implication. 
Let G be not well-coloured. Then there is a partition p = (Vi,V2) of V with 
f(Vi) n f(Vi) = such that for each e £ Ei, either e C V\ or e C V 2 . Since 
for each {wi,f2} € E 2 we have /(^i) = /(wa), we have either {wi,W2} C Vi 
or {wi,W2} C V%. Therefore V\ and V2 induce two non-empty sets of connected 
components which have no vertex label in common. Therefore, VCq is not a 
connected graph. 

We now prove the reverse implication. Assume that VCq = (C B, e) is not a 
connected graph. Then, by the definition of pointer-component graph, there is 
a partition (Ci, C2) of £ such that Ci and C2 have no vertex label in common. 
Let Vi be the set of vertices of the connected components in C, (i S {1,2}). 
Then for partition p = (V%, V2) of V we have /(Vi) Pi /(V2) = and for each 
e <E Ei U E2, either e C Vi or e C V 2 . Therefore G is not well-coloured. ■ 
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Clearly, if G = B(Ei, E2) G Q is well-coloured and E is desirable for B (e.g., one 
could take E G u>g), then H = B(Ei,E) G Q and H is well-coloured. Therefore, 
by Lemma VCq is a connected graph iff VCh is a connected graph. 

By Theorem[l2]thc next result is essential to efficiently determine which abstract 
reduction graphs are isomorphic to reduction graphs. 

Theorem 22 

Let G G Q. Then VCq is a connected graph iff 6q ^ 0. 
Proof 

Let G = B(E\,E2)- We first prove the forward implication. Let VCq be a 
connected graph and let E G loq. Then VCh with H = B(E\,E) is a con- 
nected graph. Thus there exists a D C dom(G) such that VCh\d is a tree. By 
Lemma HH B(Ei, flip D (i?)) is a connected graph, and consequently ffip £) (i5) 6 

We now prove the reverse implication. Let E G Qq. Thus, H = B(Ei,E) is a 
connected graph, and hence VCh is a connected graph. Therefore, VCq is also 
a connected graph. ■ 

We can summarize the last two results as follows. 
Corollary 23 

Let G G Q. Then the following conditions are equivalent: 

1. G is well-coloured, 

2. VCq is a connected graph, and 

3. 9 G ± 0. 
Example 14 

By Figure[3]and Corollary l23l for (abstract) reduction graph G\ in Figure[2]we 
have #Gi 7^ 0- On the other hand, by Figure [T3l and Corollary |2"3"1 for abstract 
reduction graph G2 in Figure U we have 8g 2 = 0- 

By Corollary l23l and Theorem [12] we obtain the first main result of this paper. 
It shows that one needs to check only a few computationally easy conditions to 
determine whether or not a 2-edge coloured graph is (isomorphic to) a reduction 
graph. Surprisingly, the 'high-level' notion of pointer-component graph is crucial 
in this characterization. 
Theorem 24 

Let G be a 2-edge coloured graph. Then G isomorphic to a reduction graph iff 
G G Q and VCq is a connected graph. 

Note that in the previous theorem we can equally well replace "VCq is a con- 
nected graph" by one of the other equivalent conditions in Corollary [2^1 

In Theorem 21 in [3] it is shown that the pointer-component graph of each 
reduction graph is a connected graph. We did not use that result here - in fact 
it is now a direct consequence of Theorem [M] 

Not only is it computationally efficient to determine whether or not a 2-edge 
coloured graph G is isomorphic to a reduction graph, but, when this is the case, 
then it is also computationally easy to determine a legal string u for which 
G w 1Z U . Indeed, we can determine such a u from G = B(Ei, E2) as follows: 
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1. Determine a £ e ug. As we have mentioned before, such a E is easily 
obtained. 

2. Compute VCh with H = B(E\,E), and determine a set of edges D such 
that VCh\d is a tree. 

3. Compute G' = B{E\, E 2 , ftip D (E)), and determine awe Lql 

As a consequence, pointer-component graphs of legal strings can, surprisingly, 
take all imaginable forms. 

Corollary 25 

Every connected multigraph G = (V, E, e) with E C A is isomorphic to a 
pointer-component graph of a legal string. 



10 Flip and the Underlying Legal String 

We now move to the second part of this paper, where we characterize the fibers 
Tl^CIlu) modulo graph isomorphism. First we consider the effect of flip oper- 
ations on the set of merge edges. 

Lemma 26 

Let u be a legal string and let p G dom(u). If p is negative in u, then flip p (M M ) G 
9 U . If p is positive in u, then fhp p (M u ) ^ U . In other words, flip p (M u ) e 9 U iff 
p is negative in u. 

Proof 

Let 7Z U = B(Ei,E 2 ). By the definition of flip p , flip p (M M ) g oj u . It suffices to 
prove that G = B(Ei,Rip p (M u j) is a connected graph when p is negative in u 
and not a connected graph when p is positive in u. Graph B(Ei,M u ) has the 
following form: 

Now if p is negative in u, then G has the following form: 

\ \ / / 

\ \ / / 

\ X / X 

Thus in this case G is connected. 

If p is positive in u, then G has the following form: 



= Pl Pl : 



Thus in this case G is not connected. ■ 
Lemma 27 

Let u be a legal string and let p,q E dom(u). If p and q are overlapping in u 
and not both negative in u, then flip| p (J |(M u ) G U . 
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Proof 

Let 1Z U = B(Ei, E2). Then B(E\, M u ) has the following form (we can assume 
without loss of generality that p appears before q in the path from s to t): 



Assume that p is positive in u - the other case {q is positive in u) is proved 
similarly. By the proof of Lemma it follows that B(E\, flip (M u )) has the 
following form: 



Therefore, q G bridge(B(£'i, Qip p (M u ))). By Thcorcm[T51 the pointer-component 
graph of B(Ei,&ipr p q \(M u )) has only one vertex. Consequently, B(E\, Qips pq ^ {M u 
is connected and thus flip r pq \ (M„) G 9 U . m 

Lemma 28 

Let u be a legal string, and let D C dom(it) be nonempty. If flip D (M u ) G 9 U , 
then either there is a p G D negative in u or there are p,q G D positive and 
overlapping in u. 

Proof 

Let £ u = B(E 1 ,E 2 ,M U ) and let nip D (M u ) G 6 U . Then B(E 1 ,Q.ip D (M u )) is a 
connected graph. Assume to the contrary that all elements in D are positive and 
pairwise non-overlapping in u. Then there is a p G D such that the domain of 
the p-interval does not contain an element in D\{p}. By the proof of Lemma 1^1 
B(Ei, flip (M u )) consist of two connected components, one of which does not 
have vertices labelled by elements in D\{p}. Therefore B(Ex ) Qip D (M u )) also 
contains this connected component, and thus B(E\, Hip D (M u )) has more than 
one connected component - a contradiction. ■ 

By the previous lemmata, we have the following result. 
Theorem 29 

Let u be a legal string, and let D C dom(u) be nonempty. If flip D (M ll ) G U , 
then either there is a p G D negative in u with flip_(M u ) G 9 U or there are 
p, q G D positive and overlapping in u with fhp| p g } (M u ) G 6 U . 



11 Dual String Rules 

We now define the dual string rules. These rules will be used to characterize 
the effect of flip operations on the underlying legal string. For all p, q G n with 
p^qwe define 

• the dual string positive rule for p is defined by dspr p (uipu2pu3) = u\pu2pu^ 1 

• the dual string double rule forp, q is defined by Asdr p ^ q {uipu2qu^pu4 L quc,) — 
Uipu4qu 3 pu 2 qu 5 , 
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where u\, u 2 , ■ ■ ■ , U5 are arbitrary (possibly empty) strings over IT. Notice that 
the dual string rules are self-inverse. Also notice the strong similarities between 
dspr and spr, and between dsdr and sdr. Both dspr p and spr p invert the 
substring between the two occurrences of p or p. However, dspr p is applicable 
when p is negative, while spr p is applicable when p is positive. Also, spr p 
removes the occurrences of p and p, while dspr does not. A similar comparison 
can be made between dsdr and sdr. 

The domain of (sequences of) dual string rules is defined similarly as for string 
rules. Thus, e.g., dom(dsdr P:g ) = {p, q}. 

Definition 30 

Let u and v be legal strings. We say that u and v are dual, denoted by ~d if 
there is a (possibly empty) sequence p of dual string rules applicable to u such 
that p>{u) ~ v. m 

Notice that is an equivalence relation. Clearly, «d is reflexive. It is sym- 
metrical since dual string rules are self-inverse, and it is transitive by function 
composition: if p>i(u) w v and p> 2 (v) ~ w, then (ip 2 Pi)(u) ~ w. 

Since dspr p is applicable when p is negative in u and dsdr p g is applicable when 
p and q are positive and overlapping, the following result is a direct corollary to 
Lemma |2"51 

Corollary 31 

Let u be a legal string, and let D C dom(w) be nonempty. If Qip D (M u ) 6 6 U , 
then there is a dual string rule p with dom(p) C D applicable to u. 

Let ip — p n ■ ■ ■ p\ with each pi (for 1 < i < n) a dual string rule. We define 
odom((,c) = ©Kj<„ dom(pi). Thus, odom(cp) C dom(p). We call p reduced if 
dom(pi) n dom(pj) = for all 1 < i < j < n. Note that if p is reduced, then 
doTa(p) = odom(<^>). 

Let G = B(Ei, E2, E3) be an extended abstract reduction graph, and let D C 
dom(G). Then we define flip D (G) = B{E U E 2 , flip G /,r, (E 3 )), where G = B{E 1 ,E 2 ). 

Lemma 32 

Let u be a legal string, and let cp be a sequence of dual string rules applicable 
to u. Then £ v r u ) ~ ^Pd{^u) with D = odom(i^). Consequently, H v ( u ) ~ 1Z U . 



It suffices to prove the result for the case p = dspr p with p G II and for the case 
p = dsdr p g with p, q E II. We first prove the case where p — dspr p for some 
p G II is applicable to u. Then by the second figure in the proof of Lemma 1^1 
we see that the inversion of the substring between the two occurrences of p in u 
accomplished by <p> faithfully simulates the corresponding effect of flip p on £ u . 
We only need to verify that p is negative in flip p (£ u ). To do this, we depict £ u 
such that the vertices are represented by their identity instead of their label: 

S = ■ ■ ■ = Vl V 2 = ■ ■ ■ = V 3 Vi = ■ ■ ■ = t 



Proof 
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where the vertices Vi, i £ {1, 2, 3, 4}, are labelled by p. Then flip (£ u ) is 
s = ■ ■ • = vi v 3 • ■ ■ v 2 v A = ■ ■ ■ = t 



Therefore p is indeed negative in fhp p (£„), and consequently £ v ( u ) ~ flip p (£u) 

We now prove the case where <p = dsdr P)? withp, q £ II. Let £ u — B(E\, E 2 , £3) 
then £ u has the following form 



where we omitted the edges in E 2 . Since p and q are positive in u, fhp{ p q ^(£ u ) 
has the following form: 



where we again omitted the edges in E 2 . Thus, we see that interchanging the 
substring in u between p and q and the substring in u between p and q accom- 
plished by ip faithfully simulates the corresponding effect of flippy on £ u . We 
only need to verify that both p and q are positive in fhp p q (£ u )- To do this, we 
depict £ u such that the vertices are represented by their identity instead of their 
label: 




where the vertices Vi and Wi, i £ {1, 2, 3, 4}, are labelled by p and q, respectively. 
Then flip Pig (£„) is 




Therefore both p and q are indeed positive in flip pg (£' u ), and consequently 
£ip(u) ~ fiip P ,q(£u)- ■ 

Thus, if ip 1 and ip 2 are sequences of dual string rules applicable to a legal string 
u with odom((/3i) = odomf^), then £ Vl t u ) ~ £u> 2 M an d thus fi(u) w tp 2 (u). 

Lemma 33 

Let u be a legal string, and let D C dom(w). There is a reduced sequence ip of 
dual string rules applicable to u such that dom(yj) — D iff &ip D (M u ) £ 9 U . 

Proof 

The forward implication follows directly from Lemma 1321 We now prove the 
reverse implication. If D = 0, we have nothing to prove. Let D 0. By 
Corollary HU there is a dual string rule p\ with D\ — dom(pi) C D applicable 



to u. By Lemma [ 



•7>i(«) 



flipjj ,{£u) and D± = odom(pi) = dom(pi). Thus, 



Sip_D\Di (M pi ( u ) ) £ pi ( u ) ■ Now by iteration, there is a reduced sequence tp of 
dual string rules applicable to u such that odom(</?) = dom(c/j) = D. 
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It follows from Lemma [33] and Lemma [33J that reduced sequences of dual string 
rules are a normal form of sequences of dual string rules. Indeed, by Lemma [321 
if ip is a sequence of dual string rules applicable to a legal string u with D = 
odom^), then flip D (M u ) G 9 U . By Lemma [33] there is a reduced sequence f' 
of dual string rules applicable to u such that dom(<^') = odom(y>') = D. By the 
paragraph below Lemma [321 we have <f{u) ~ f'{u). 

We are now ready to prove the second (and final) main result of this paper. It 
shows that TZ~ l {JZ u ) (modulo graph isomorphism) is the 'orbit' of u under the 
dual string rules. That is, the legal strings obtained from u by applying dual 
string rules are exactly those legal strings to have the same reduction graph as 
u (up to isomorphism). 

Theorem 34 

Let u and v be legal strings. Then u «d v iff 1Z U s=s 1Z V . 
Proof 

The forward implication follows directly from Lemma 1321 We now prove the 
reverse implication. Let 1Z U « 1Z V . By Corollary [IT] there is a E G 9 U such that 
£ v « B{E ll E 2 ,E) with K u = B(E 1 ,E 2 ). By Theorem [T5l E = Qip D (M u ) for 
some D C dom(u). Since &ip D (M u ) G 8 U , by Lemma [331 there is a reduced 
sequence f of dual string rules applicable to u such that dom((^) = D. Now by 
Lemma 132"] £ v ( u ) ~ Qip D (£ u ) ~ £ v , and therefore, by Theorem 1 101 f{u) w v. m 

12 Discussion 

This paper characterizes, having 1Z as the function which assigns to each legal 
string u its reduction graph 1Z U , the range of 1Z (Theorem [24]) and each fiber 
Tl^iT^u) modulo graph isomorphism (Theorem [3JJ . 

The first characterization corresponds to a computationally efficient algorithm 
that determines whether or not a graph G is isomorphic to a reduction graph. 
Moreover, if this is the case, then the algorithm given below Theorem [24] al- 
lows for an efficient determination of a legal string u such that G « 1Z U . The 
first characterization relies on the notion of merge-legal edges and its flip oper- 
ation introduced in this paper. In particular, the connected components in the 
subgraph induced by the reality edges and the merge-legal edges and the flip 
operation turns out to be relevant in this context. 

The second characterization determines, given u, the whole set 1Z~ 1 {1Z U ) modulo 
graph isomorphism. From a biological point of view, the fibers characterize which 
micronuclear genes obtain the same macronuclear structure. It turns out that 
7?. _1 (7?. ll ) is the orbit of u under the dual string rules. Surprisingly, these two 
types of string rewriting rules are very similar to the string positive rules and 
the string double rules that are used to define the model. Moreover, each two 
legal strings u and v in such a fiber can be transformed into each other by a 
sequence tp of string rewriting rules without using a pointer more than once. 
Therefore, the number of string rewriting rules in f can be bounded by the size 
of the domain of u (and v). 

The reduction graph of a legal string u in a certain sense retains all information 
regarding applicability of string negative rules in each successful reduction of 
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u, while discarding almost all other information regarding the rules applied in 
successful reductions, see [3]- Therefore, the fiber in a sense characterizes all 
legal strings that have the same properties regarding the application of string 
negative rules. In biological terms, this may allow for a way to determine whether 
or not the strategies regarding the string negative rule are different among the 
different kinds of (genes in) ciliates. 
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